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Abstract 


Our research investigates how observations can be catego- 
rized by integrating a qualitative physical model with ex- 
periential knowledge. Our domain is diagnosis of patho- 
logic gait in humans, in which the observations are the gait 
motions, muscle activity during gait, and physical exam 
data, and the diagnostic hypotheses are the potential muscle 
weaknesses, muscle mistimings, and joint restrictions. Pa- 
tients with underlying neurological disorders typically have 
several malfunctions. Among the problems that need to 
be faced are: the ambiguity of the observations, the ambi- 
guity of the qualitative physical model, correspondence of 
the observations and hypotheses to the qualitative physi- 
cal model, the inherent uncertainty of experiential knowl- 
edge, and the combinatorics involved in forming compos- 
ite hypotheses. Our system divides the work so that the 
knowledge-based reasoning suggests which hypotheses ap- 
pear more likely than others, the qualitative physical model 
is used to determine which hypotheses explain which obser- 
vations, and another process combines these functionalities 
to construct a composite hypothesis based on explanatory 
power and plausibility. We speculate that the reasoning ar- 
chitecture of our system is generally applicable to complex 
domains in which a less-than-perfect physical model and 
less-than-perfect experiential knowledge need to be com- 
bined to perform diagnosis. 


The Promise of Deep Knowledge 

Recently in knowledge-based systems research, there has 
been an emphasis on “deep” knowledge over “shallow” 
knowledge. Deep knowledge is based on the causal mech- 
anisms underlying the domain, typically obtained through 
scientific studies and incorporated into physical models of 
the domain, while shallow knowledge is based on experi- 
ential knowledge, typically obtained from human experts 
and incorporated in rule-based systems [4], The promise of 
deep knowledge is that the conclusions of a knowledge-based 
system be inferred from an accurate physical model of the 
domain, rather than dependent on a time-consuming and 
error-prone knowledge engineering effort. 

The emphasis on deep knowledge has influenced re- 
search on diagnosis [9, 19]. In this line of research, the 
process of diagnosis is reduced to manipulating the physical 


model within the knowledge-based system, according to the 
following presumptions. 

1. Let M be the physical model which describes the do- 
main when everything is functioning as it should. 

2. Let M be the set of all physical models consistent 
with the observations. If A/ £ M, then there is a 
malfunction. 

3. If there is a malfunction, then for each M' 6 M, the 
differences between M' and M is a possible diagnosis. 

That is, the normal physical model is selectively changed 
until it predicts (or is compatible with) the aberrant obser- 
vations. Each change corresponds to an abnormality or mal- 
function. For example, in de Kleer and Williams’ method 
for diagnosis, such a change consists of suspending the con- 
straints of a component in the model, i.e., the outputs of 
a malfunctioning component are considered to be uncon- 
strained. Each set of changes that accounts for the obser- 
vations is considered a possible diagnosis. The process of 
diagnosis is a search for each such set of changes. 

An example might clarify the intended role of physical 
models. Consider the device in Figure 1, by now a familiar 
example in the literature. Mult-1, Mult-2, and Mult-3 are 
three multipliers, with A-E as their inputs and X-Z as their 
outputs. Add-1 and Add-2 are adders with X-Z as inputs 
and F and G as outputs. Given specific inputs to this de- 



Figure 1: Multiplier and Adder Example 
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vice, and assuming I lint the device is fiiiiclnming properly. 
the outputs can be predicted If the actual outputs are im- 
proper, malfunctions can be attributed to one or more com- 
ponents bv tracing through the connections and determining 
whether abnormal behavior would account for the outputs. 

For example, if output F is improper, then one possibility 
is that Mult-1 lias a malfunction because if Mult-1 was no 
longer behaving like a multiplier, then F would be affected. 

Further testing with different inputs can verify or rule out 
this hypothesis about Mult-1 [6. II;. 

Note that ill this example, experiential knowledge is 
not mentioned, nor is it needed. The model of the device— 
the behavior of the components and the interconnections 
between the components — provides sufficient knowledge for 
diagnosis to be efficiently performed. If models with simi- 
lar properties could be constructed for other domains, then 
the knowledge acquisition bottleneck could be avoided, and 
the conclusions of knowledge-based systems would lie better 
justified. 


Difficulties in Fulfilling the Promise 

We have been studying diagnosis in the domain of human 
pathologic gait (walking disorders) [13, 18). In this domain, 
the goal of diagnosis is to determine the muscular and skele- 
tal causes of the patient’s abnormal gait motions. Exam- 
ples of malfunctions include reduced ranges of motion of 
joints (contractures), joint pain, muscle weakness, and mus- 
cle spasticity. Typically, patients will have a known under- 
lying disorder such cerebral palsy or arthritis, which gives 
rise to the joint and muscle malfunctions. Diagnosing joint 
and muscle malfunctions is done to help determine what 
kind of treatments (e.g., physical therapy, braces, surgery 
on joints and muscles) will best correct the patient’s gait. 

The data is primarily of the following types: history, 
physical exam, and motion data. The patient’s history in- 
cludes information about past and present diagnoses and 
treatments and demographic data. The physical examina- 
tion provides data about the range of motion of joints and 
strength of muscles. The motion data include the gait mo- 
tions of the patient obtained through a special camera sys- 
tem. This measures joint angles, gait velocity, stride length, 
etc. Also, EMG measurements are taken while the patient 
is walking, which provides data on muscle activity. 

Since human walking is subject to the laws of Physics, 
just as any other physical activity, it appears that deep- 
knowledge-diagnosis would be appropriate. Unfortunately, 
there are several difficulties in doing diagnosis based on 
physical models. 
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. Ambiguity. Even if a domain model can be con- 
structed, there is a problem of obtaining sufficiently 
detailed data for the domain model. If a quantitative 
simulation is to be performed, then precise measure- 
ments of the initial state and input parameters need 
to be obtained. In many domains, this presents no 
difficulties. For example, electronic circuits are suffi- 
ciently constrained and well-understood so that care- 
fully selected measurements can give the state of the 
device. Unfortunately in gait analysis, many internal 
gait parameters cannot be directly or even indirectly 
measured by current technology, e.g., EMG data is a 
best a qualitative measure of muscle forces [20] . Gen- 
erally in medical domains, many internal parameters 
cannot be accurately measured without overly invasive 
actions. 

One answer to this problem (and part of our own 
solution) is the use of qualitative physical models 
[1, 8, 10, 15]. Such models still provide explanatory 
power in spite of ambiguous data. However, qualita- 
tive models introduce their own sources of ambiguity. 
As a rule, qualitative simulation does not predict a 
single sequence of states, but produces several alter- 
native state sequences. Additional information is re- 
quired to disambiguate between them [7], Also, qual- 
itative simulation might produce state sequences that 
are spurious [16]. 

3. Computational complexity. Diagnosis is inherently 
computationally complex. The number of possible di- 
agnoses is combinatorially large. If n different mal- 
functions can occur, then there are 2 n possible sets of 
malfunctions. If each malfunctions can be caused in m 
different ways, then there are 2 mn possibilities. This 
large hypothesis space is not just an abstract possibil- 
ity. In pathologic gait, patients with underlying neu- 
rological disorders typically have several malfunctions, 
some of which are “primary” (due to the underlying 
disorder), while others are attempted compensations. 

Clearly, there is a need to modify the assumption that 
physical models for performing efficient and accurate diag- 
nosis can be readily constructed in all domains. A more rea- 
sonable assumption is that physical models can perform spe- 
cific diagnostic subtasks. Experiential knowledge acquired 
from human experts is still needed to help guide the search 
through the hypothesis space. 

The Subtasks That Physical Mod- 
els and Experiential Knowledge 
Are Good For 


1. Construction. Domain models with sufficient predic- 
tive and explanatory power need to be constructed be- 
fore deep-knowledge-diagnosis can proceed. However, 
quantitative modeling of human gait is still a challeng- 
ing research topic [12]. This is not just a problem in 
our domain. More often than not, simulation of com- 
plex mechanical devices and biological processes are 
open research problems. 


The next question is to identify the respective roles that 
physical models and experiential knowledge can play. Un- 
fortunately, many factors are domain-dependent, and no 
“task model” is sufficiently developed to clearly answer this 
question (see [3, 5, 17] for what has been developed). Our 
own experience (the next best thing) is the following. 

As mentioned earlier, deep knowledge is usually associ- 
ated with physical models, which are intended to have pre- 
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dictive and explanatory power. A physical model of patho- 
logic gait needs to represent at least the muscles, the limb 
segments, and the interactions among them. Given a partic- 
ular situation (joint positions, limb and trunk momentums, 
muscle and ground forces), it ideally should predict changes 
in position and momentum. A major difficulty is obtain- 
ing accurate data on muscle forces. As a consequence, we 
will rely on a qualitative physical model, based on knowl- 
edge about the motions that muscles control and on relative 
strengths of muscles. For example, the action of a muscle 
on a joint might be described as “causes flexion” as opposed 
to a differential equation. Such models have weak predic- 
tive capabilities. In our domain, we will at best be able to 
explain how a motion could be caused by a combinations 
of factors, but the ambiguity concerning the exact amount 
of force associated with each factor precludes even qualita- 
tively accurate predictions. 

Another difficulty is using the physical model to search 
for diagnostic hypotheses. Because the interactions of the 
components (muscles and limb segments) are highly com- 
plex (unlike the device of Figure 1), a particular motion 
could be caused in a large number of ways, especially when 
combinations of malfunctions are considered. Also, the ef- 
fect of any single malfunction can propagate throughout the 
rest of the system. A sprained ankle, for instance, affects 
the whole gait, not just the motion of the ankle. A phys- 
ical model then might be able to suggest local, individual 
causes for a particular abnormal motion, but searching for 
all possible causes for each abnormal motion and generating 
composite hypotheses based on just this information will be 
computationally prohibitive. 

Can experiential knowledge be used for the diagnos- 
tic reasoning that is difficult to do with physical models? 
As is typical in knowledge-based systems, experience can 
provide rules that associate abnormal observations with the 
malfunctions that typically cause them. Hence, experiential 
knowledge can give valuable clues concerning what malfunc- 
tions should be considered. Such knowledge, though, is not 
very good for determining whether a hypothesized malfunc- 
tion accounts for the observations in a particular case, and 
is no good for considering combinations of interacting mal- 
functions. 

Thus, we can usually use (qualitative) physical models 
to suggest some of the possible causes of an observation and 
to determine what observations a hypothesis accounts for 
(explanatory coverage). Experiential knowledge can asso- 
ciate hypotheses with observations and suggest which hy- 
potheses appear more likely than others. In general, rea- 
soning based on experiential knowledge is good for generat- 
ing individual malfunctions that appear likely, while model- 
based reasoning is best for testing explanatory coverage of 
malfunctions and combinations thereof. Figure 2 summa- 
rizes these conclusions. 

Integrating Model-Based Reason- 
ing with Experiential Reasoning 


Experiential Reasoning 

Model- Based Reasoning 

generate: malfunctions 

generate: malfunctions 

associated with 

that cause 

observations 

observations 

test: likelihood of 

test: explanatory 

malfunctions 

coverage of 


malfunctions 

good for: generating 

good for: testing 

individual 

combinations 

malfunctions 

of malfunctions 


Figure 2: Experiential vs. Model-Based Reasoning 


reasoning with experiential reasoning. Hypothesis assembly 
is a general technique for constructing and critiquing com- 
posite hypotheses. It requests information via the following 
domain-dependent functions: 

1. a function that rates the “plausibility” of a hypothe- 
sis. The likelihood of malfunctions as determined by 
experiential reasoning can be used to rate plausibility. 
Explanatory coverage can be used to break ties (i.e., 
when the likelihoods of two malfunctions are too close 
for meaningful comparison). 

2. a function that rates the importance of observations. 
In pathologic gait, the amount of the difference be- 
tween abnormal and normal for a particular motion 
parameter determines its importance. 

3. a function that determines what hypotheses can ex- 
plain an observation. The qualitative physical model 
can be used to suggest hypotheses that explain an ob- 
servation. 

4. a function that determines what observations a (com- 
posite) hypothesis does and does not explain. The 
qualitative physical model can be used to determine 
what observations are explained by a combination of 
malfunctions. 

The subtasks that experiential knowledge and physical mod- 
els are good for fit into hypothesis assembly quite well. 

Hypothesis assembly uses this information to construct 
a composite hypothesis with the following properties: 

• The composite hypothesis explains as many obser- 
vations as possible in comparison with similar com- 
posite hypotheses. That is, no local change (addi- 
tion/deletion of some part to/from the composite hy- 
pothesis) improves explanatory coverage. 

• Each hypothesis part within the composite hypothesis 
is as plausible as possible, viz. in comparison to other 
hypothesis parts explaining some particular observa- 
tion. 


These considerations lead us to the following proposal based 
on using hypothesis assembly [14] to integrate model-based 
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The composite hypothesis is parsimonious, i.e., no hy- 
pothesis in the composite hypothesis is superfluous. 



(A hypothesis within a composite hypothesis is super- 
fluous if it can be removed without loss of explanatory 
coverage.) 

Hypothesis assembly also critiques this composite hypothe- 
sis in comparison to other composite hypotheses. Thus, one 
composite hypothesis is selected and its goodness in com- 
parison to other hypotheses is determined. 

Hypothesis assembly, however, is not guaranteed to find 
the “correct” hypotheses. Given the difficulties in deep- 
knowledge-diagnosis discussed earlier, no method can be 
expected to guarantee truth. Nor is hypothesis assembly 
guaranteed to produce the “best" hypotheses according to 
normative criteria such as “most probable hypothesis that 
accounts for all the observations.” Such criteria are com- 
putationally intractable (2j. We conjecture that hypothesis 
assembly is the best that can be done within the constraints 
of imperfect physical models and computational tractability. 


Conclusion 

It has been proposed that diagnosis should be based on 
physical models of the domain. However, several factors 
make it unlikely that diagnosis can be just be based on phys- 
ical models. These factors include constructing a sufficiently 
powerful physical model, obtaining sufficiently accurate ob- 
servations, and performing diagnosis efficiently. Diagnosis 
in the domain of human pathologic gait illustrates these 
problems. Our proposal is to integrate qualitative physical 
models with experiential knowledge so that both sources 
of information will be efficiently and effectively utilized. In 
particular, they can be integrated using the technique of hy- 
pothesis assembly, which constructs a composite diagnostic 
hypotheses with several desirable properties: explanatory 
power, plausibility, and parsimony. We speculate that the 
reasoning architecture of our system is generally applicable 
to complex domains in which a less-than-perfect physical 
model and less-than-perfect experiential knowledge need to 
be combined to perform diagnosis. 
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